BEHAVIORAL  SCIENCE 
An  activity  of  the  Chief, 


RESEARCH  LABORATORY 
Research  and  Development 


J.  E.  UHLAN  ER 
Director 


JAMES  E.  WIRRICK 
COL,  GS 
Commanding 


NOTICES 


DISTRIBUTION:  Primary  distribution  of  this  report  has  been  mad®  hy  BESRL.  Please  address 
correspondence  concerning  distribution  of  reports  to:  U.  S.  Army  Behavioral  Science  Research 
Laboratory,  Attn:  CRDBSRL,  Room  239,  Commonwealth  Building,  1320  Wilson  Blvd.,  Arlington, 
Virginia  22209. 


FINAL  DISPOSITION:  This  report  may  be  destroyed  when  it  is  no  longer  needed.  Please  do  not 
return  it  to  the  U.  S.  Army  Behavioral  Science  qesearch  '  iboratory. 


NOTE:  The  findings  in  this  report  are  not  to  be  construed  as  an  official  Department  of  the 
Army  position,  unless  so  designated  by  other  authorized  documents. 


Ttcbaical  Risiard  Not#  207 


EFFECTS  OF  SPECTRUM  SAMPLING  ON 
SPEECH  INTELLIGIBILITY 


Anthony  E.  Castelnovo 


Jamer  H.  Banks,  Jr.,  Task  Leader 


COMBAT  SYSTEMS  RESEARCH  DIVISION 
Aaron  Hyman,  Chief 


U.  S.  ARMY  BEHAVIORAL  SCIENCE  RESEARCH  LABORATORY 

Office,  Chief  of  Research  and  Development 
Department  of  the  Army 

Room  239,  The  Commonwealth  Building 
1320  Wilson  Boulevard,  Arlington,  Virginia  22209 


March  1969 


Army  Project  Number 
2Q062106A723 


Monitor  Performance-c 


This  document  has  been  approved  for  public  release  and  sale;  its  distribution  is  unlimited. 


BESRL  Technical  Research  Reports  and  Technical  Research  Notes  are  intended  for 
sponscs  of  R&D  tasks  and  other  research  and  military  agencies.  Any  findings 
ready  for  implementation  at  the  time  of  publication  are  presented  in  the  latter  part 
of  the  Brief.  Upon  completion  of  a  major  phase  of  the  task,  formal  recommenda¬ 
tions  for  official  action  normally  are  conveyed  to  appropriate  military  agencies 
by  briefing  or  Disposition  Form. 


FOREWORD 


The  MONITOR  PERFORMANCE  Work  Unit  employs  controlled  laboratory  experimentation 
■in  developing  snd  testing  principles,  techniques,  and  operating  procedures  to  improve  the 
performance  of  personnel  working  in  a  variety  of  Army  monitoring  jobs.  The  effects  of  fac¬ 
tors  associated  with  the  signal,  the  monitoring  task,  the  environment,  and  the  individual 
in  various  combinations  are  studied  simultaneously. 

"Enhancement  of  Communication  Operator  Performance"  is  a  work  sub-unit  concerned 
with  variables  affecting  the  intelligibility  of  audio  signals  and  soeech.  Present  emphasis 
is  on  the  area  of  spectrum  selection  and  binaural  listening.  The  present  publication  re¬ 
ports  on  preliminary  experimentation  dealing  with  the  effect  on  speech  intelligibility  of 
excising  segments  of  the  spectrum  employed  in  a  communication  channel.  The  specific 
objective  was  to  compare  the  effect  of  filtering  out  several  narrow  frequency  bands  lo¬ 
cated  over  the  speech  spectrum  with  that  of  eliminating  a  single  band  covering  the  same 
total  extent. 


The  study  was  conducted  as  a  part  of  ROT&E  Project  2Q62106A723,  "Human  Perfor¬ 
mance  in  Military  Systems,"  FY  1969  Work  Program. 


EFFECTS  OF  SPECTRUM  SAMPLING  ON  SPEECH  INTELLIGIBILITY 


BRIEF 


Requirement: 

To  explore  the  effect  on  speech  intelligibility  of  filtering  out  several  narrow  bands  of 
frequencies  from  various  parts  of  the  speech  spectrum  as  compared  with  excising  the  same 
total  amount  of  the  spectrum  in  a  single  band. 


Procedure: 

Phonetically  balanced  (PB)  stimulus  word  lists  spoken  by  three  different  individuals 
were  presented  to  36  subjects  through  a  filter  system  which  permitted  variation  in  the  con* 
figuration  of  pass  bands  over  a  1300-cycle  bandwidth.  Experimental  conditions  totaled 
54--18  filter  configurations  each  under  three  noise  conditiors.  Analysis  of  variance  tech¬ 
niques  were  applied  to  the  data. 


Findings: 

Intelligibility  was  significantly  higher  for  configurations  in  which  the  bandwidths 
excised  were  distributed  over  the  spectrum  than  for  equal  bandwidth  concentrated  in  one 
area. 

Findings  held  for  all  three  noise  conditions. 


Utilization  of  Findings. 

If  these  findings  are  confirmed  for  bandwidths  covering  a  wider  range  of  frequencies, 
voice  radio  communication  could  be  facilitated  in  two  ways:  (1)  Portions  of  the  spectrum 
in  which  unwanted  noise  occurs  could  be  filtered  out,  resulting  in  higher  intelligibility  of 
the  message;  and  (2)  greater  use  could  be  made  of  a  given  communication  channel  by 
sending  more  than  one  message  over  the  channel  at  the  same  time,  each  message  using 
different  portions  of  the  spectrum. 


EFFECTS  OF  SPECTRUM  SAMPLING  ON  SPEECH  INTELLIGIBILITY 


CONTENTS 


Page 


OBJECTIVE 

1 

METHOD 

2 

Spectrum  Sampling 

Subjects 

Stimulus  Material 

Experimental  Procedure 

Experimental  Treatment 

RESULTS 

7 

CONCLUSIONS 

13 

LITERATURE  CITED 

15 

DISTRIBUTION 

16 

DO  Form  1473  (Document  Control  Data  •  R&D) 

18 

TABLES 


Page 


Table 

1. 

Filter  speclflcatlons--frequency  as  a  function  of 
loss  in  dB 

3 

2. 

Comparison  of  experimentally  obtained  intelligi¬ 
bilities  with  those  computed  by  use  of  the  articu¬ 
lation  index  at  three  noise  levels  at  given  band- 
widths 

8 

3. 

Analysis  of  variance  results 

9 

4. 

Results  of  t-test  comparison  of  selection  means 

13 

FIGURES 

Figure 

1. 

Block  diagram  of  equipment  used  in  experiment 

4 

2. 

Pass  bands  of  the  18-filter  configurations, 
measured  at  the  -16  d£  point 

6 

3. 

Relationship  between  articulation  index  and  total 
bandwidth 

10 

4. 

Intelligibility  and  articulation  indexes  for  the 
18-filter  configurations  under  noise  condition  1 

10 

5- 

Obtained  intelligibility  for  the  lS-filter  con¬ 
figurations  under  noise  condition  2 

11 

e. 

Obtained  intelligibility  for  the  18-filter  con¬ 
figurations  under  noise  condition  3 

11 

EFFECTS  OF  SPECTRUM  SAMPLING  ON  SPEECH  INTELLIGIBILITY 


OBJECTIVE 

One  of  the  more  serious  problems  the  operator  In  a  military  communi¬ 
cation  system  faces  Is  that  of  noise  which  obscurer  the  message.  The 
noise  may  be  broad-band  noise  or  appear  In  specific  bands,  depending  on 
the  source.  If  the  noise  appears  in  relatively  narrow  bands,  these  bands 
might  be  eliminated  and  with  them,  the  unwanted  noise.  However,  there  are 
no  firm  data  on  which  to  estimate  the  effect  on  the  operator's  performance 
of  such  a  procedure. 

The  present  study  was  designed  to  gain  preliminary  information  about 
the  effect  on  performance  of  excising  portions  of  the  speech  spectrum. 

It  is  recognized  that  sophisticated  techniques,  such  as  digital  transmis¬ 
sion  of  voice,  are  being  developed  and  employed  to  overcome  the  effects 
of  noise  and  to  transmit  communications  In  secure  form.  There  are,  how¬ 
ever,  Instances  where  these  techniques  are  not  feasible,  and  where  It  may 
be  useful  to  reduce  the  amount  of  spectrum  dealt  with  by  excising  selected 
bands. 

The  frequency  domain  of  speech  and  the  relationship  of  frequency  to 
intelligibility  have  been  the  subject  of  research  by  many  investigators 
(1,  2,  3,  4,  3,  6).  These  investigators  have  measured  average  speech 
spectra  and  have  studied  the  effects  on  speech  Intelligibility  of  excising 
continuous  bands  from  the  upper  and  lower  areas  of  the  speech  spectrum; 
for  the  most  part,  these  studies  have  involved  filtering  out  a  single 
portion,  or  pass  band,  of  the  spectrum.  On  the  basis  of  results  of  these 
studies,  communications  equipment  has  been  designed  to  take  advantage  of 
reduced  spectrum  requirements. 

Another  way  of  treating  the  frequency  domain  is  to  filter  out  pass 
bands  from  several  locations  in  the  speech  spectrum  simultaneously. 

Kryter  employed  a  spectrum  configuration  in  which  two  narrow  bands  were 
eliminated,  a  configuration  which  resulted  in  higher  intelligibility  than 
did  elimination  of  a  single  band  covering  the  same  extent  of  the  spectrum. 
In  a  follow-up  study,  Kryter  (j)  observed  that  for  constant  speech  Intelli¬ 
gibility  the  total  effective  bandwidth  required  for  the  best  multiple  pass- 
band  system  is  less  than  that  required  for  contiguous  pass-band  systems  by 
a  factor  of  2.  This  phenomenon  may  be  explained  as  a  function  of  redun¬ 
dancy.  Removing  some  narrow  bands  reduces  redundancy  but  not  necessarily 
intelligibility.  That  redundancy  is  a  characteristic  of  the  speech  spec¬ 
trum  and  that  some  reduction  may  be  made  without  a  corresponding  reduction 
in  intelligibility  has  been  noted  before.  M.  R.  Schroeder  (8)  briefly 
reviewed  the  work  of  Homer  Dudley,  noting  Dudley's  contribution  to  the 
origin  of  Vocoders,  which  take  advantage  of  the  redundancy  of  the  speech 
spectrum.  - 


Apart  from  Kryter's  work,  which  employed  a  limited  amount  of  filter¬ 
ing,  there  appears  to  be  no  other  relevant  work  In  the  literature.  As  for 
research  on  the  effect  of  noise  on  a  spectrum  composed  of  discrete  bands, 
there  appears  to  be  none  at  all.  The  present  study  concentrates  on  number 
and  sice  of  segments  excised  from  the  speech  spectrum  as  they  affect 
Intelligibility  and  the  effect  of  noise  on  the  Intelligibility  of  a  speech 
spectrum  composed  of  discrete  bands.  Such  Information  would  be  useful  in 
assessing  the  feasibility  of  eliminating  segments  of  the  spectrum  which 
may  carry  particularly  high  levels  of  noise  and  for  employing  the  spectrum 
space  in  the  interstices  for  other  uses. 


METHOD 


Spectrum  Sampling 

Sampling  of  the  spectrum  was  accomplished  by  using  a  set  of  24 
electrical  pass-band  filters  which  permitted  passing  very  narrow  bands. 

The  specifications  for  these  filters  are  shown  In  Table  1.  The  bandwidths^ 
of  the  individual  filters  at  -16  dB  varied  from  50  Hz  to  115  Hz.  The  24 
?llters  formed  a  1500-cycle  pass  band  from  573  Hz  to  1684  Hz.  Each  of  the 
24  pass-band  filters  could  be  switched  in  and  out  of  the  circuit  independ¬ 
ently.  The  filter  set  was  inserted  in  the  system  as  shown  in  Figure  1. 

The  system  noise  was  -55  dB  relative  to  the  230  root  mean  square  (rms) 
value  of  the  speech  (integrated  over  .5  seconds). 


Subjects 

The  stimulus  material  was  presented  to  56  test  subjects.  These  were 
Army  enlisted  men  under  50  years  of  age,  with  no  language  problem  and  no 
previous  experience  in  the  conmunlcatlon  field.  A  hearing  test  conducted 
at  the  time  of  the  experiment  showed  that  all  were  in  hearing  category  1. 


Stimulus  Material 

The  speech  material  consisted  of  18  phonetically  balanced  (PB)  word 
lists  (9)  which  had  been  recorded  by  three  speakers,  six  lists  by  each 
speaker. 


The  bandwidth  was  computed  for  the  -16  dB  point  to  approximate  the 
effective  bandwidth  of  the  filter.  The  -5  dB  or  -6  dB  point  often 
used  in  specifying  the  filter  bandwidth  does  not  take  into  account 
the  intelligibility  contributed  by  the  filter  skirt  beyond  that  point. 


-  2  - 


Table  1 


FILTER  S  PEC IFIC ATIONS  -  -FREQUENCY  AS  A  FUNCTION  OF  LOSS  IN  dB 


Filter 

-30dB 

-3dB 

Center 

Frequency 

-3dB 

-30  dB 

1 

364 

379 

398 

415 

434 

C. . 

402 

415 

436 

449 

466 

3 

438 

450 

470 

484 

499 

4 

469 

483 

505 

519 

538 

5 

504 

519 

533 

557 

579 

6 

545 

560 

583 

597 

618 

7 

583 

598 

624 

644 

666 

8 

627 

644 

668 

688 

714 

9 

674 

690 

712 

734 

755 

10 

718 

734 

755 

779 

802 

11 

763 

779 

813 

827 

850 

12 

813 

330 

840 

877 

901 

13 

860 

878 

904 

927 

955 

14 

913 

Q2G 

941 

976 

1002 

15 

958 

978 

1010 

1033 

1056 

16 

1016 

1033 

1049 

1087 

1116 

17 

1066 

1089 

1123 

1153 

1182 

18 

1132 

1153 

1191 

1216 

1247 

19 

11Q6 

1218 

1257 

1282 

1293 

20 

1260 

1283 

1323 

1352 

1384 

21 

1330 

1353 

1370 

1426 

1458 

22 

1404 

1426 

1444 

1500 

1512 

23 

1472 

1504 

1540 

1575 

1614 

24 

1550 

1579 

1598 

1662 

1695 

I 

i 


-  3  - 


d 

X 

IO 

X 

UJ 

ro 

UJ 

CL 

X 

2 

<t 

X 

2 

2 

(0 

x 

N 

** 

d 

X 

UJ 

h- 

-1 

UJ 

(VI 

U 

UJ 

< 

UJ 

2 

“3 

x 

X 

Q. 

h- 

OD 

>• 

h- 

O 

> 

Cl  (/) 


X 

0) 

in 

X 

d 

X 

UJ 

£ 

n 

UJ 

£ 

o 

0. 

u_ 

z 

o 

o 

2 

-1 

X 

2 

2 

< 

X  ^ 

5  o  k 

S  “  w 

£  X  x 


h-  o  O 

u  °  & 

i  2  g  § 

*  3  18  5 

X  Q.  I- 


d 

2 

X 

UJ 

1 

UJ 

CO 

X 

O 

o 

o 

1- 

0) 

iJ 

Q 

z 

< 

< 

X 

< 

i- 

CD 

X 

UJ 

o 

0) 

O 

z 

0> 

Ul 

o 

E>perimental  Procedure 


The  subjects  were  located  In  an  Industrial  Acoustics  Company  series 
1200  chamber^  In  which  a  very  low  level  of  ambient  noise  was  maintained. 
PDR-10  headsets  were  employed. 

The  subjects  were  given  30  hours  of  training  over  a  period  of  a  week 
(6  hours  a  day  for  3  days)  in  listening  to  PB  word  lists  uttered  by  the 
three  speakers.  The  materials  had  been  subjected  to  filtering  similar  to 
that  used  in  the  experiment  proper  and  mixed  with  noise. 

Following  the  training,  the  subjects  started  the  experimental  sessions. 
These  consisted  of  three  half-hour  sessions  on  each  of  three  days.  Each 
half-hour  experimental  session  was  followed  by  a  one-hour  rest  period. 

Six  experimental  conditions  were  presented  each  experimental  session. 

Thus,  over  the  nine  sessions,  $4  experimental  conditions  were  presented-- 
18  filter  conditions  under  each  of  3  noise  conditions.  Figure  2  presents 
the  filter  conditions.  The  noise  source  was  a  Grason  Stadler^  noise 
generator  set  for  ‘'speech"  shaping.  Noise  Condition  1  was  zero  noise 
from  the  noise  generator;  Noise  Condition  2  was  23  dB  below  the  maximum 
rms  speech  amplitude  (integrated  over  .3  seconds).  Noise  Condition  3 
mixed  in  noise  at  15  dB  below  the  maximum  rms  speech  amplitude  (inte¬ 
grated  over  .3  seconds). 


Experimental  Treatment 

The  design  was  such  that  all  subjects,  talkers,  and  word  lists  were 
associated  with  each  of  the  experimental  conditions.  The  subjects  had  been 
instructed  to  respond  to  each  stimulus  word  regardless  of  how  unsure  they 
were;  and  except  for  rare  instance,  a  response  was  made  to  each  word. 

The  data  were  reduced  to  the  mean  intelligibility  values  for  each  of 
the  3^  experimental  conditions.  Intelligibility  was  also  computed  by  use 
of  the  Articulation  Index,  an  objective  measure  based  on  [measured]  pitch 
and  other  physical  dimensions  of  the  speech  sound.  An  analysis  of  variance 
was  made  for  the  main  effects  and  interactions  of  days,  sets  of  experimental 
conditions,  session,  speaker,  filter  conditions,  and  noise  conditions. 


^Commercial  names  are  used  only  in  the  interest  of  precision  in  describ¬ 
ing  the  experimental  procedure.  Their  use  does  not  constitute  indorse¬ 
ment  by  the  Army  or  by  BESRL. 


-  5  - 


.  373KHz  I.684KHZ 


n  im  i  ii  j  j  igg.ji.  .  zl  j' 


—  (NjrO^-IO  ®N09®O-(\llO^  in  ®  N  00 


NOIlVUflOldNOO  Mill  Id 


6 


Figure  2.  Pass  bands  of  the  18  filter  configurations,  measured  at  the  -16  dB  point.  (Gaps  in  the  lines  show  the 
segments  eliminated.) 


RESULTS 


Intelligibility  values  for  the  18  filter  configurations  at  the 
three  noise  levels  are  shown  in  Table  2  in  comparison  with  the  Articulation 
Index.  Table  3  shows  the  analysis  of  variance  results. 

A  slight  significant  improvement  was  found  over  the  three  days  of 
testing  even  though  the  experimental  sessions  had  been  preceded  by  a  week 
of  training.  This  improvement,  however,  did  not  affect  the  results,  as 
each  of  the  experimental  conditions  was  replicated  12  times  on  each  of 
the  three  days.  As  anticipated,  the  filter  and  noise  factors  were  statis¬ 
tically  significant  beyond  the  .01  level.  Blocks  of  experimental  condi¬ 
tions,  periods,  and  speakers  produced  non-significant  F  ratios,  and  the 
interactions  of  days  by  filter  conditions  and  days  by  noise  conditions 
showed  a  probability  of  occurrence  between  .10  and  .05.  The  filter-by¬ 
noise  level  Interaction  was  not  significant,  although  there  was  a  signifi¬ 
cant  change  in  the  relationship  between  bandwidth  and  intelligibility  as 
a  function  of  noise  level,  as  discussed  below. 

Intelligibility  produced  by  the  different  filtei  configurations  was 
also  compared  to  bandwidth  and  to  the  Articulation  Index  (Table  2).  Eaeh 
of  the  filter  configurations  used  was  fairly  representative  of  the  total 
lJOO-cycle  band.  The  filter  configurations  were  designed  to  have  the  same 
average  Articulation  Index  per  cycle  as  the  total  spectrum  in  order  to 
maintain  a  linear  relationship  between  Articulation  Index  and  bandwidth 
(Figure  3)  and  thus  avoid  confounding  the  variation  in  intelligibility 
emanating  from  two  sources,  amount  of  spectrum  and  use  of  spectra  concen¬ 
trated  in  particular  areas  of  the  spectrum.  This  design  was  adopted  even 
though  for  the  area  of  the  spectrum  used  in  the  study  the  likelihood  of 
confounding  was  not  critical.  Loss  in  articulation  for  the  upper  part  of 
che  spectrum  area  as  compared  to  the  lower  part  was  not  great,  as  reflected 
by  the  values  for  configurations  16  and  18  in  Figure  J>.  Even  though  the 
pass  bands  were  concentrated  in  the  upper  and  lower  parts  of  the  spectrum, 
their  values  lie  nearly  on  the  same  line  as  the  values  for  the  distributed 
configurations . 

Figures  4,  5>  and  6  show  the  intelligibility  data  for  the  18  filter 
configurations  plotted  for  each  noise  level.  The  ordinate  shows  percent 
intelligibility,  the  abscissa  the  total  bandwidth  of  the  filter  config¬ 
uration.  The  data  for  each  noise  level  was  fitted  by  a  parabola  of  the 
form  Y  *  A  +  BX  ■  CJ?  (10).  Sixteen  of  the  18  configurations  were  included 
in  the  array  fitted.  The  data  points  for  configurations  16  and  18  were 
left  out  because  these  configurations  were  not  distributed  samplings  of  the 
available  spectrum.  Configuration  18  included  the  lower  518  cycles  of  the 
spectrum  and  configuration  16  the  upper  815  cycles. 


-  7  - 


Table  2 


COMPARISON  OF  EXPERIMENTALLY  OBTAINED  INTELLIGIBILITIES 
WITH  THOSE  COMPUTED  BY  USE  OF  THE  ARTICULATION  INDEX  AT 
THREE  NOISE  LEVELS  AT  GIVEN  BANDWIDTHS 


Filter 
Conf ig. 

Band¬ 

width 

Noise  Level 

Obtained 

1 

AI 

Computed 

Obtained 

2 

AI 

Computed 

Obtained 

3 

AI 

Computed 

1 

13H 

50 

68 

51 

58 

37 

30 

2 

931 

59 

44 

43 

33 

29 

18 

3 

917 

58 

42 

43 

33 

29 

18 

4 

1028 

57 

50 

49 

38 

35 

21 

5 

1022 

55 

50 

44 

38 

30 

21 

6 

903 

55 

42 

39 

33 

29 

18 

7 

906 

55 

42 

43 

33 

30 

18 

8 

750 

51 

35 

37 

27 

26 

15 

9 

317 

50 

40 

40 

30 

29 

16 

10 

739 

50 

35 

37 

26 

25 

14 

11 

688 

46 

32 

40 

25 

25 

14 

12 

667 

46 

30 

40 

24 

26 

12 

13 

577 

43 

25 

26 

18 

15 

11 

14 

478 

38 

20 

30 

15 

19 

9 

15 

600 

37 

27 

30 

20 

20 

11 

16 

815 

34 

35 

18 

26 

11 

14 

17 

491 

29 

18 

20 

15 

14 

9 

18 

518 

27 

22 

22 

17 

14 

10 

Table  3 


ANALYSIS  OF  VARIANCE  RESULTS 


Source 

Degrees 

of 

Freedom 

Sum 

of 

Squares 

Mean 

Square 

F  Ratio 

Between  Subjects 

35 

14,135-0 

403.8 

Within: 

Days 

2 

1,210.3 

605.1 

37-12* 

Sets 

2 

21.6 

10.8 

.66 

Error  1 

68 

1,106.1 

16. 3 

Periods 

2 

484.4 

242.2 

•57 

Speakers 

2 

1,100.7 

550.3 

1.29 

Pd  x  Spk 

4 

3,  ^0.3 

817.6 

1.91 

Error  2 

9 

3,834.2 

426.0 

Filters 

17 

5,873.9 

345.5 

9.79* 

Noise  Levels 

2 

41,161.6 

20,580.8 

583.52* 

F  x  N 

34 

738.1 

21.7 

.61 

Days  x  Filters 

34 

1,385.5 

40.8 

1.16 

Days  x  Noise  Levels 

4 

304.2 

76.1 

2.16 

Error  3 

1728 

*0,955.6 

35.27 

Total 

1943 

135,581.4 

-  9  - 


Significant  at  the  .01  laval. 


PERCENT  PB  WORDS  CORRECT 


900  600  700  800  900  1000  1100  1200  1300 

BANDWIDTH 

Figure  3.  Relationship  between  Articulation  Index  and  total  bandwidth. 

(The  numbers  along  the  line  designate  filter  configurations.) 


Intelligibility 
Computed  from 


Figure  4.  Intelligibility  and  Articulation  Indexes  for  the  18  filter  configurations  under 

Noise  Condition  1.  (The  figures  along  the  lines  designate  filter  configurations.) 


PERCENT  PB  WORDS  CORRECT  -n  PERCENT  PB  WORDS  CORRECT 


60 


I 


50 

40 

30 


20 


h  “ 


it 


io  1  ■  ■ - 1 - 1 - 1 - 1  i  i - 1 

500  600  700  800  900  1000  1100  1200  1300 


BANDWIDTH 


igure  5.  Obtained  intelligibility  for  the  18  filter  configurations  under 
Noise  Condition  2. 


Figure  6.  Obtained  intelligibility  for  the  18  filter  configurations  under 
Noise  Condition  3. 


11 


The  redundancy  in  the  speech  spectrum  is  seen  as  the  curvature 
exhibited  by  the  data  in  Figures  4,  5,  and  6.  In  Figure  4,  which  pre¬ 
sents  the  data  for  the  lowest  noise  level,  intelligibility  reaches  a 
maximum  of  58  percent  for  the  full  spectrum  of  1300,  but  appears  to  have 
reached  an  asymptote  at  about  1100  cycles;  evm  at  1000  cycles  the  fitted 
curve  does  not  show  much  loss.  Also,  the  empirically  obtained  values  for 
filter  configurations  2,  3>  and  4  under  Noise  Condition  1  were  not  signi¬ 
ficantly  lower  in  intelligibility  than  that  for  configuration  1  (t -tests 
gave  probabilities  between  .70  and  .80).  Figures  3  and  6,  which  present 
the  data  for  increased  levels  of  noise,  show  progressively  less  curvature. 

The  curvature  component  for  Noise  Condition  1  was  significant  at  the  .001 
level.  For  Noise  Condition  2,  degree  of  curvilinearity  was  less,  reaching 
significance  only  at  the  .05  level.  The  curvature  for  Noise  Condition  3 
appears  slight  and  does  not  reach  significance  at  the  .05  level.  From 
these  comparisons,  it  appears  that  there  is  in  fact  an  interaction  between 
noise  level  and  filter  configuration  which  was  not  apparent  from  the 
general  interaction  test. 

Configurations  16  and  18,  examples  of  massed  bandwidth,  have  an 
Articulation  Index  and  bandwidth  similar  to  those  of  configurations  9  and 
14,  in  which  bandwidth  is  distributed  over  the  spectrum.  The  distributed 
configurations  9  and  14  produced  significantly  higher  intelligibility  than 
the  massed  configurations  (Table  4) .  The  relationship  held  for  all  three 
noise  levels.  Also,  distributed  configuration  4,  which  had  a  bandwidth 
substantially  less  than  that  of  massed  configuration  1,  produced  almost 
the  same  level  of  intelligibility  as  configuration  1  at  each  of  the  three 
noise  levels. 

Using  the  Articulation  Index  computed  for  each  filter  configuration 
and  referring  to  the  typical  relationship  between  Articulation  Index  and 
intelligibility  of  PB  words  (Figure  7  in  Reference  4),  the  expected  intelli¬ 
gibility  was  computed  for  the  filter  configurations  and  plotted  (the  line 
of  dashes  in  Figure  4).  The  values  approximate  a  straight  line.  With  the 
computed  intelligibility  as  a  reference,  the  experimentally  obtained  intel¬ 
ligibility  values  for  the  configurations  with  undistributed  bandwidths 
(points  16  and  18)  approximated  what  would  be  expected  for  this  amount  of 
bandwidth.  The  configurations  with  distributed  bandwidths  produced  com¬ 
paratively  higher  intelligibility.  There  seems  to  be  no  apparent  reason 
for  the  discrepancy  between  the  obtained  and  computed  intelligibility  for 
configuration  1. 

The  configurations  which  were  the  poorest — that  is,  the  least  well 
dlstributed--samplings  of  the  spectrum  (16,  17,  and  18)  yielded  the  lowest 
Intelligibility.  As  shown  in  Figure  2,  they  left  the  largest  areas  of  the 
spectrum  unsampled.  Configurations  16  and  18,  as  has  been  noted,  were  each 
composed  of  a  single  pass  band.  Configuration  17  consisted  of  two  pass 
bands,  one  at  each  end  of  the  spectrum  with  a  gap  of  820  cycles  in  the  center. 


12 


Table  4 


RESULTS  OF  t-TEST  COMPARISONS  OF  SELECTION  MEANS 


Filter 

Configurations 

Noise 

Level  Condition 

1 

2 

3 

1  vs 

4 

.72 

•59 

.66 

9  vs 

16 

5.65** 

8.40** 

6.66** 

14  vs 

17 

4.09** 

3.64** 

1.47** 

14  vs 

18 

4.28** 

3.20** 

1.24* 

•Signiflcvit  at  tha  .05  lavel 
••Significant  at  tha  .01  leval 


CONCLUSIONS 

Segments  of  the  speech  spectrum  may  be  excised  under  conditions,  of 
low  noise  without  Incurring  a  proportionate  reduction  In  Intelligibility. 
For  the  samplings  in  the  study,  there  does  not  seem  to  be  a  critical  size 
of  segments  excised,  except  for  configuration  17  which  left  a  large  gap 
In  the  center  of  the  spectrum.  The  other  configurations  were  not  differ¬ 
entially  affected  by  size  of  segments  excised.  In  Figures  3>  A,  and 
the  data  points  for  configurations  with  approximately  equivalent  band- 
widths  fall  close  together.  Note  the  cluster  formed  by  configurations 
2,  3,  6,  and  7,  configurations  differing  In  number  of  sampler  and  size 
of  segments  excised.  Thus,  It  appears  that  (for  the  spectrum  used  here) 
bands  of  200  cycles  or  more  may  be  excised  with  no  greater  loss  than 
would  result  from  excising  an  equivalent  amount  In  a  number  of  smaller 
segments.  It  appears  that  substantial  amounts  of  a  spectrum  may  be  excised 
to  eliminate  bands  of  Interference  or  to  use  the  resulting  Interstices  as 
channel  space  for  other  transmissions. 

The  effect  of  noise  Is  not  only  to  reduce  the  level  of  Intelligibility 
but  also  (as  may  be  seen  by  comparing  the  curvatures  In  Figures  3;  A,  and 
3)  to  change  the  shape  of  the  function  relating  Intelligibility  to  band¬ 
width.  As  the  slgnal-to-nolse  ratio  becomes  less  favorable,  the  redun¬ 
dancy  decreases.  However,  speech  from  which  segments  of  the  spectrum  have 
been  excised  appears  to  be  just  as  resistant  to  broad-band  noise  as  Is 
continuous  spectrum  speech,  as  Indicated  by  the  fact  that  the  differences 
In  Intelligibility  between  experimentally  obtained  and  computed  Intelli¬ 
gibilities  are  of  about  the  same  magnitude  for  each  of  the  noise  levels. 


-  13  - 


It  appears  that  Intelligibility  is  not  a  simple  function  of  amount 
of  bandwidth  and  its  position  in  the  speech  spectrum,  but  depends  also 
on  how  the  spectrum  is  amp  led.  A  configuration  which  samples  across 
the  entire  available  spectrum  is  more  efficient  than  one  massed  in  a 
single  area,  even  when  the  bandwidth  is  massed  in  the  richer  information 
bearing  portion  of  th<  spectrum. 


14 


LITERATURE  CITED 


1.  Licklider,  J.  C.  k.  and  G.  A.  Miller.  The  perception  of  speech. 

In  S.  S.  Stevens  (Ed.)  The  Handbook  of  Experimental  Psychology. 

New  York:  John  Wiley  and  Sons,  Inc.  1951*  Ch.  26,  Pp.  1040- 
1074. 

2.  Fletcher,  Harvpy.  Speech  and  hearing  in  communication.  Princeton, 
N.  J.:  D.  Van  Nostrand  Company,  Inc.,  1953*  Ch.  Id,  Pp.  418-11. 

3.  French,  N.  R.  and  J.  C.  Steinberg.  Factors  governing  the  intelli¬ 

gibility  of  speech  sounds.  The  Journal  of  the  Acoustical  Society 
of  America.  1947*  90-119* 

4.  Kryter,  K.  D.,  Gail  Flanagan,  and  Carl  Williams.  A  test  of  the 
20-band  and  octave-band  methods  of  computing  the  Articulation  Index. 
Contract  AF  19  (604) -4061.  Cambridge,  Mass:  Bolt  Beranek  and 
Newman,  Inc.  1961. 

3.  Pollack,  Irwin.  Effects  of  high  pass  and  low  pass  filtering  on  the 
Intelligibility  of  speech  in  noise.  Journal  of  the  Acoustical 
Society  of  America.  1948.  20,  259-266. 

6.  Beranek,  I.eo  L.  The  design  of  speech  communication  systems. 

Proceedings  of  the  Institute  of  Radio  Engineers,  1947*  35.  880-890. 

7.  Kryter,  Karl  D.  Speech  bandwidth  compression  through  spectrum 
selection.  Journal  of  the  Acoustical  Society  of  America.  I960. 

12,  5*7-556. 

8.  Schroeder,  M.  R.  VOCODERS:  Analysis  and  synthesis  of  speech. 

Proceedings  of  IEEE,  1966.  54,  723-734. 

9.  Egan,  James  P.  Articulation  Testing  Methods.  Laryngoscope.  1948a. 

955-991.  . 

10.  Snedecor,  George  W.  Statistical  Methods.  Ames,  Iowa:  The  Iowa 
State  College  Press.  1940  Ch.  14,  315-317* 


I 


Unclassified 

Security  ClsssificsUow 


DOCUMENT  CONTROL  DATA  .RAD 


I.  OR  I  GIN  A  TINS  ACTIVITY  (CcrfR—  f  AltfWJ  2J.  MRORT  IICuNlTV  CLASSIFICATION 

U.  S.  Army  Behavioral  Science  Research  Laboratory  Unclassified 

OCRD,  Washington,  D.  C.  t».  a*oui> - - - 


Is  REFONT  TiTLC 


EFFECTS  OF  SPECTRUM  SAMPLING  ON  SPEECH  INTELLIGIBILITY 


I  4  DEfCRlRTI* 


I  (TVs*  •/  rspor#  «4  IocIimJn  dm  ft ) 


S  AUTHORISI  (First  MM,  «i3i«  MlfSf,  iflll  MM) 

Anthony  E.  Castelnovo 


|S  MEWOMT  DATE 


March  1969 


[  •*.  contract  or  grant  NO. 


ft.  FROJICT  NO. 

.  Array  RScD  Proj.  No.  2Q062106A723 

e. 

Monitor  Performance 


|7«.  TOTAL  NO.  OF  RAOES  [7ft.  NO-  OF  REFS 


I  %m.  ORIGINATOR'S  RESORT  NUMVERlS) 


Technical  Research  Note  397 


•ft  OTHER  RERORT  NOHI  (Any  othmr  nvmbmrm  Rif  mmy  ft*  n«I|m4 
thf  fpmrt) 


10.  CMOTRIOUTION  STATEMENT 


This  document  has  been  approved  for  public  release  and  sele;  its  distribution  is 
unlimited . 


13.  SWONSONIII0  MILITARY  ACTIVITY 

Office,  Chief  of  Research  and  Development, 
DA,  Washington,  D.  C. 


i».  AiiTAAcr  jn  controlled  laboratory  experimentation,  the  MONITOR  PERFORMANCE  Work  Unit, 
USA  BESRL  studies  the  effects  of  factors  associated  with  the  signal,  the  monitoring 
task,  the  environment,  and  the  individual  in  a  variety  of  combinations.  One  segment 
of  work  unit  effort  is  concerned  with  variables  affecting  the  intelligibility  of  audio 
signals  and  speech,  with  present  emphasis  on  the  area  of  spectrum  selection  and  binaura 
listening.  The  current  publication  reports  on  a  study  of  the  effect  on  the  intelligi¬ 
bility  of  phonetically  balanced  (PB)  words  of  excising  several  narrow  bands  from  a  cur¬ 
tailed  speech  spectrum  (1509  cycles).  A  spectrum  composed  of  several  discrete  pass 
bands  was  compared  to  (1)  the  total  curtailed  spectrum,  (2)  the  curtailed  spectrum  with 
one  large  segment  removed  from  the  end,  and  (3)  the  articulation  predicted  by  the 
Articulation  Index.  18  PB  word  lists  uttered  at  three  speech-to-noise  ratios  consti¬ 
tuted  the  stimulus  material  presented  to  subjects  through  a  filter  system  with 
selected  pass  bands.  Results  indicate  that  at  the  higher  speech-to-noise  ratios, 
eliminating  several  narrow  bands  from  the  spectrum  does  not  result  in  a  corresponding 
reduction  in  intelligibility.  When  the  speech  is  35  dB  above  the  noise,  a  reduction 
of  20$  or  more  can  be  made  in  bandwidth  without  noticeable  reduction  in  intelligibility 
As  the  speech-to-noise  ratio  is  decreased,  the  decrement  in  intelligibility  becomes 
more  nearly  proportional  to  the  decrease  in  bandwidth.  A  given  bandwidth  distributed 
over  a  spectrum  area  is  more  effective  than  an  equivalent  bandwidth  massed  in  one  part 
of  the  spectrum.  Distributed  sampling  of  the  spectrum  was  found  to  be  more  effective 
than  would  be  expected  from  Articulation  Index  computations. 


|\f\  MM  I  A  TO  MAiAcnoo»o«mni.i 

VtJ  f  M.V  ••  1  4  /O  OMOi.*T«  ro.  AMMV  USE. 


JAM  A4,  WHICH  IE 

-  18  - 


Unclassified 
Security  Classification 


♦Speech  intelligibility 
Audio  signals 
♦Binaural  listening 
•Spectrum  selection 
Communication  channel 
Frequency  bands 
Human  performance 
Phonetic  balance 
♦Filter  configurations 
Laboratory  facilities 
Noise  levels 

♦Intelligibility  values;  analysis 
♦Communications  research 
♦Military  communications  system 
♦Articulation  Index 
Signal -to-noise  ratio 
Statistical  methods 
Pass  band  system 


